AITopics | machine-generated content

Collaborating Authors

machine-generated content

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis

Li, Siyuan, Wulianghai, Aodu, Lin, Xi, Li, Guangyan, Chen, Xiang, Wu, Jun, Li, Jianhua

arXiv.org Artificial IntelligenceOct-15-2025

With the increasing integration of large language models (LLMs) into open-domain writing, detecting machine-generated text has become a critical task for ensuring content authenticity and trust. Existing approaches rely on statistical discrepancies or model-specific heuristics to distinguish between LLM-generated and human-written text. However, these methods struggle in real-world scenarios due to limited generalization, vulnerability to paraphrasing, and lack of explainability, particularly when facing stylistic diversity or hybrid human-AI authorship. In this work, we propose StyleDecipher, a robust and explainable detection framework that revisits LLM-generated text detection using combined feature extractors to quantify stylistic differences. By jointly modeling discrete stylistic indicators and continuous stylistic representations derived from semantic embeddings, StyleDecipher captures distinctive style-level divergences between human and LLM outputs within a unified representation space. This framework enables accurate, explainable, and domain-agnostic detection without requiring access to model internals or labeled segments. Extensive experiments across five diverse domains, including news, code, essays, reviews, and academic abstracts, demonstrate that StyleDecipher consistently achieves state-of-the-art in-domain accuracy. Moreover, in cross-domain evaluations, it surpasses existing baselines by up to 36.30%, while maintaining robustness against adversarial perturbations and mixed human-AI content. Further qualitative and quantitative analysis confirms that stylistic signals provide explainable evidence for distinguishing machine-generated text. Our source code can be accessed at https://github.com/SiyuanLi00/StyleDecipher.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.12608

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Evaluating Machine Expertise: How Graduate Students Develop Frameworks for Assessing GenAI Content

Chen, Celia, Leitch, Alex

arXiv.org Artificial IntelligenceApr-28-2025

This paper examines how graduate students develop frameworks for evaluating machine-generated expertise in web-based interactions with large language models (LLMs). Through a qualitative study combining surveys, LLM interaction transcripts, and in-depth interviews with 14 graduate students, we identify patterns in how these emerging professionals assess and engage with AI-generated content. Our findings reveal that students construct evaluation frameworks shaped by three main factors: professional identity, verification capabilities, and system navigation experience. Rather than uniformly accepting or rejecting LLM outputs, students protect domains central to their professional identities while delegating others--with managers preserving conceptual work, designers safeguarding creative processes, and programmers maintaining control over core technical expertise. These evaluation frameworks are further influenced by students' ability to verify different types of content and their experience navigating complex systems. This research contributes to web science by highlighting emerging human-genAI interaction patterns and suggesting how platforms might better support users in developing effective frameworks for evaluating machine-generated expertise signals in AI-mediated web environments.

large language model, machine-generated content, natural language, (16 more...)

arXiv.org Artificial Intelligence

2504.17964

Country: North America > United States > Maryland > Prince George's County > College Park (0.15)

Genre: Research Report > New Finding (0.89)

Industry: Education > Educational Setting > Higher Education (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

Huang, Liting, Zhang, Zhihao, Zhang, Yiran, Zhou, Xiyue, Wang, Shoujin

arXiv.org Artificial IntelligenceJun-7-2024

The recent advancements in generative AI models, which can create realistic and human-like content, are significantly transforming how people communicate, create, and work. While the appropriate use of generative AI models can benefit the society, their misuse poses significant threats to data reliability and authentication. However, due to a lack of aligned multimodal datasets, effective and robust methods for detecting machine-generated content are still in the early stages of development. In this paper, we introduce RU-AI, a new large-scale multimodal dataset designed for the robust and efficient detection of machine-generated content in text, image, and voice. Our dataset is constructed from three large publicly available datasets: Flickr8K, COCO, and Places205, by combining the original datasets and their corresponding machine-generated pairs. Additionally, experimental results show that our proposed unified model, which incorporates a multimodal embedding module with a multilayer perceptron network, can effectively determine the origin of the data (i.e., original data samples or machine-generated ones) from RU-AI. However, future work is still required to address the remaining challenges posed by RU-AI. The source code and dataset are available at https://github.com/ZhihaoZhang97/RU-AI.

arxiv, dataset, modality, (13 more...)

arXiv.org Artificial Intelligence

2406.04906

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Idaho > Ada County > Boise (0.05)
(12 more...)

Genre: Research Report (0.70)

Industry:

Health & Medicine (0.46)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.70)

Add feedback

MUGC: Machine Generated versus User Generated Content Detection

Xie, Yaqi, Rawal, Anjali, Cen, Yujing, Zhao, Dixuan, Narang, Sunil K, Sushmita, Shanu

arXiv.org Artificial IntelligenceMar-28-2024

As advanced modern systems like deep neural networks (DNNs) and generative AI continue to enhance their capabilities in producing convincing and realistic content, the need to distinguish between user-generated and machine generated content is becoming increasingly evident. In this research, we undertake a comparative evaluation of eight traditional machine-learning algorithms to distinguish between machine-generated and human-generated data across three diverse datasets: Poems, Abstracts, and Essays. Our results indicate that traditional methods demonstrate a high level of accuracy in identifying machine-generated data, reflecting the documented effectiveness of popular pre-trained models like RoBERT. We note that machine-generated texts tend to be shorter and exhibit less word variety compared to human-generated content. While specific domain-related keywords commonly utilized by humans, albeit disregarded by current LLMs (Large Language Models), may contribute to this high detection accuracy, we show that deeper word representations like word2vec can capture subtle semantic variances. Furthermore, readability, bias, moral, and affect comparisons reveal a discernible contrast between machine-generated and human generated content. There are variations in expression styles and potentially underlying biases in the data sources (human and machine-generated). This study provides valuable insights into the advancing capacities and challenges associated with machine-generated content across various domains.

dataset, machine generated, user generated content detection, (11 more...)

arXiv.org Artificial Intelligence

2403.19725

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.68)
Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

From SEO To GEO: What GPT Marketers Need to Know

#artificialintelligenceMar-20-2023, 21:20:51 GMT

If you are 25 or younger, chances are high that you never encountered the paper version of Yellow Pages but throughout the 20th century, print directories were among the primary ways for consumers and businesses to connect. Established in 1886, Yellow Pages posted its final print issue in January 2019 closing the chapter on 130 plus history of print directory marketing. In the late 1990s, the new exotic profession of online directory marketing emerged with the rise of Yahoo! and other online directories, and quickly disappeared as the search engines took over. Search engine to be precise, since Google quickly took the lion's share of the market in the early 2000s. Since then, every business is being bombarded by armies of search engine optimization (SEO) marketers offering to analyze and optimization of your websites, social networks, and all kind of tricks designed to get the business to the top of search results.

engine, marketer, wikipedia, (16 more...)

#artificialintelligence

Industry: Information Technology > Services (0.36)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

Add feedback

Three Ways To Use AI And Machine Learning To Create Customers For Life

#artificialintelligenceJan-2-2019, 19:20:31 GMT

Only a few years ago, business to business (B2B) technology companies would sell a solution to a customer, wait three to five years, then reapproach that same customer to offer a renewal or a completely new product. But these days, the initial purchase doesn't necessarily translate into a continuum of sales, and it doesn't hold the promise of customer retention like it once did. That's because today's customers have much higher expectations in order to remain loyal. Getting them to love your brand and love your products takes a customer-first mindset and a company-wide commitment to improve the customer experience. Companies that excel at customer experience are using artificial intelligence (AI) and machine learning heavily to produce immersive, authentic experiences across every customer touch point.

artificial intelligence, customer, machine learning, (11 more...)

#artificialintelligence

Industry:

Information Technology (0.61)
Media > Film (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.64)

Add feedback

How AI is Impacting Content Marketing

#artificialintelligenceJul-14-2017, 13:07:34 GMT

While there are plenty of dire-sounding discussions taking place these days around artificial intelligence (AI) and machine learning--and their potential to disrupt the world as we know it--this isn't technology of the future. New technologies are promising to upend the traditional ways in which content is conceived, produced, and disseminated. A Copyblogger article from as far back as 2015 noted that both Forbes and the Associated Press were producing machine-generated content. These examples are likely to both thrill and chill content marketers, depending on where they're perched along the content creation continuum--including the need to generate an increasing volume of content and to make a living from creating that content. For now, though, there is fortunately less to fear than there is to cheer, says Natalia Markova, senior web content strategist with Jellyfish, a global digital agency.

artificial intelligence, machine learning, natural language, (13 more...)

#artificialintelligence

Industry:

Information Technology (0.31)
Media (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.98)
Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback